Goto

Collaborating Authors

 repair operation


Graph Repairs with Large Language Models: An Empirical Study

arXiv.org Artificial Intelligence

Property graphs are widely used in domains such as healthcare, finance, and social networks, but they often contain errors due to inconsistencies, missing data, or schema violations. Traditional rule-based and heuristic-driven graph repair methods are limited in their adaptability as they need to be tailored for each dataset. On the other hand, interactive human-in-the-loop approaches may become infeasible when dealing with large graphs, as the cost--both in terms of time and effort--of involving users becomes too high. Recent advancements in Large Language Models (LLMs) present new opportunities for automated graph repair by leveraging contextual reasoning and their access to real-world knowledge. We evaluate the effectiveness of six open-source LLMs in repairing property graphs. We assess repair quality, computational cost, and model-specific performance. Our experiments show that LLMs have the potential to detect and correct errors, with varying degrees of accuracy and efficiency. We discuss the strengths, limitations, and challenges of LLM-driven graph repair and outline future research directions for improving scalability and interpretability.


A Novel Community Detection Based Genetic Algorithm for Feature Selection

arXiv.org Machine Learning

The selection of features is an essential data preprocessing stage in data mining. The core principle of feature selection seems to be to pick a subset of possible features by excluding features with almost no predictive information as well as highly associated redundant features. In the past several years, a variety of meta-heuristic methods were introduced to eliminate redundant and irrelevant features as much as possible from high-dimensional datasets. Among the main disadvantages of present meta-heuristic based approaches is that they are often neglecting the correlation between a set of selected features. In this article, for the purpose of feature selection, the authors propose a genetic algorithm based on community detection, which functions in three steps. The feature similarities are calculated in the first step. The features are classified by community detection algorithms into clusters throughout the second step. In the third step, features are picked by a genetic algorithm with a new community-based repair operation. Nine benchmark classification problems were analyzed in terms of the performance of the presented approach. Also, the authors have compared the efficiency of the proposed approach with the findings from four available algorithms for feature selection. The findings indicate that the new approach continuously yields improved classification accuracy.


Repair and Prediction (under Inconsistency) in Large Biological Networks with Answer Set Programming

AAAI Conferences

We address the problem of repairing large-scale biological networks and corresponding yet often discrepant measurements in order to predict unobserved variations. To this end, we propose a range of different operations for altering experimental data and/or a biological network in order to re-establish their mutual consistency-an indispensable prerequisite for automated prediction. For accomplishing repair and prediction, we take advantage of the distinguished modeling and reasoning capacities of Answer Set Programming. We validate our framework by an empirical study on the widely investigated organism Escherichia coli.


Development of Self-Maintenance Photocopiers

AI Magazine

The traditional reliability design methods are imperfect because the designed systems aim at fewer faults, but once a fault happens, the systems might hard fail. To solve this problem, we present a self-maintenance machine (SMM), one that can maintain its functions flexibly even though faults occur. To achieve the capabilities of diagnosing and repair planning, a model-based approach that uses qualitative physics was proposed. Regarding the repair-executing capability, control-type repair strategy was followed. A prototype of the SMM was developed, and it succeeded in maintaining its functions if the structure did not change. However, the prototype revealed the following problems when its reasoning system was used with a commercial product as embedded software: (1) poor performance of the reasoning system, (2) system size that was too large, (3) low adaptability to environmental changes, and (4) roughness of qualitative repair operations. To solve these problems, we proposed new reasoning method based on virtual cases and fuzzy qualitative values. This methodology is one of knowledge compilation, which gives better reasoning performance and can deal with real-world applications such as the SMM. By using this method, we finally developed a commercial photocopier that has self-maintainability and is more robust against faults. The commercial version has been supplied worldwide as a product of Mita Industrial Co., Ltd., since April 1994.